Upper tails for triangles 
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^ ' Abstract 

Q^ I With ^ the number of triangles in the usual (Erdos-Renyi) random 

CN ' graph G{m,p), p > 1/m and 77 > 0, we show (for some C^ > 0) 



C^ 



(N 
> 



O 



Q> . Pr(^ > (1 + '7)E0 < exp[-a,niin{mVlog(l/p),mV}]- 

CIh' 



This is tight up to the value of C„. 



1 Introduction 

Write G = G{m,p) for the usual (Erdos-Renyi) random graph, and £, = ^(G) 
for the number of triangles in G. The main purpose of this note is to prove 

Theorem 1.1. For any rj > Q and p > m.^-'^lnm, 



^. : pr(e > (1 + ^) ( 3 )/) < p^^^^'p') . (1) 

in 

O \ As perhaps first observed by Vu [13], the bound is tight up to the value 

of the constant in the exponent, since the probability that G contains a 
complete graph on (say) 2mp vertices is expp[0(m^j)^)]. The lower bound 
on p is needed because below this the behavior changes, and a better lower 

^ ' bound on the probability in ([T|) (ignoring the constants in the exponents) is 

j^ ■ the more natural one given by 

Proposition 1.2. For p > 1/m, Pr(^ > 2(^)p^) > exp[— O(m'^p^)], unless 
the probability in question is zero (i.e. unless 2(J!^)p^ > (T)). 
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For p < 1/m the question is uninteresting: one easily has (for -q £ (0, 1), 
say) Pr(^ > (1 + il){^)p^) = Pr(^ > 1) = Q{m^p^), with the lower bound 
given, e.g., by (one of) the Bonferroni Inequalities. Of course Proposition 
11.21 is not news if p > m~^ Inm, but see the discussion in Section [3l 

The proof of Theorem 11.11 extends without serious modification to give 
a full resolution of the question for all p of interest, saying that in all cases 
the larger of the preceding two lower bounds is the truth: 

Theorem 1.3. For any ij > and p 

Pr(^ > (1 +r/)('^)/) < exp[-fi^(min{mVlog(l/p),mV})]- 
(Again, the statement is not interesting for p < 1/m.) 

A result close to Theorem 1 1.1 1 was proved independently by S. Chatterjee 
in [3]. The approaches there and here are related, though the proof given 
here is quite a bit shorter and proves a little more, in that [3j requires 
p > C~^m~^lnm for some small C,j. (Prompted by [3J, a version of the 
present paper proving only Theorem II. H — the proof is essentially the same 
as that given here — was posted at f5].) 

Though only a first case of the analogous question for copies of a general 
(fixed) graph H, the problem addressed by Theorem 11.11 (or Theorem II. 3p 
has a surprisingly substantial history; see e.g. [13l [lOl [TTl [12l |9l H] for 
some of this, or [3] for a thorough account. Here we just mention that 
Kim and Vu J12J were the first to prove that the probability in question is 
exp[— ri^(?7i^p-^)]. (See Section |1] for a little more on general H.) 

For the proof of Theorem 11.31 it's convenient to work with a tripar- 
tite version. Let H be the random tripartite graph on n + n + n vertices; 
thus V = ^(H) is the disjoint union of three n-sets, say Vi,V2,V3, and 
FT{xy E E(H.) = p) whenever x,y are in distinct T^'s, these choices made 
independently. Then with ^' the number of triangles in H (so E^' = n'^p^), 
we show 

Theorem 1.4. For any 6 > and p, 

Pr(^' > (1 + (5)nV) < exp[-J]5(min{nVlog(l/p),nV})]- (2) 

That Theorem 11.41 implies Theorem 11.31 is presumably well-known, but 
we give the easy argument. It is of course enough to prove Theorem 11.31 
when m = 3n. Let r/ be as in Theorem 11.31 and set 6 = 77/(2 + rj). We 
may choose H by choosing G (on V = [3n]) and a uniform equipartition 

1/1 U 1/2 U V3 of V, and setting 

£'(H) = {xy G E{G) : x,y belong to distinct Ws}- 



Of course 

mG] = pCiG), (3) 

where p = n^/( I^) (~ 2/9) and the conditioning event is {G = G}. On the 
other hand, with a{G) = Pr(^' < (1 - 6)p^{G)\G), we have 

E[?'|G] < a(G)(l - 6)paG) + (1 - a(G))e(G), 

whence, using (^, a{G) < 1 — p6/{l — p + pS) =: 1 — (3. Thus (by Theorem 

[HD 

exp[-ri5(min{nV log(l/p), nV})] > Pr(C' > (1 + 5)nV) 
and (noting (1 + 6) /{I — 6) = 1 + rj) Theorem 11.31 fohows. 



Theorem 11.41 is proved in Section [2l There is more than one way to 
prove it, but we just give the easiest (that we know). Section [3] gives the 
easy (and probably known) proof of Proposition 11.21 ^^^ ^^ Section [H we 
briefly mention the situation for general H. As noted there, the present 
method will surely prove more than Theorem 11.31 but at this writing we 
don't know exactly how much more. 

2 Proof of Theorem 11.41 

We rename the parts of our tripartition A, B, C and always take a, 6, c to be 
elements of A, B, C respectively. A triangle is then simply denoted abc. The 
set of triangles of H is denoted T. As usual Ny{x) = {y £ Y : xy G £'(H)} 
and dyix) = |A^y(x)|, and we also use (e.g.) d{a) = niax{dBia),dc{a)} and 
d{a,b) = \Nc{a) n Nc{b)\. For disjoint X,Y C V we use V(X) (resp. 
V(X, Y)) for the set of edges with one end in X (resp. one end in each of 
X,Y). 

Set t = ln{l/p), s = m.m{t,np}, a = 5/3, e = .02a and (say) 7 = 1/e. 
We may assume 

p<{l + 5)-^/\ (4) 

since otherwise the left side of ([2]) is zero. We may also assume: 5 — so also 
£ — is (fixed but) small (since ([2]) becomes weaker as 5 grows); given 5, n is 
large (formally, n > n^); and, say, 

p > e^^n"-^ (5) 



(since for smaller p, Theorem 11.41 becomes trivial with an appropriate il-s)- 
We say an event occurs with large probability (w.l.p.) if its probability 
is at least 1 — expp[Te'*n^p^] for some fixed T > and small enough e (and 
p satisfying ([5])), and write "a <* /3" for "w.l.p. a < /3." Note that an 
intersection of 0{n) events that hold w.l.p. also holds w.l.p. 

Let A' = {a: dc{a) < np^'^} and B' = {b : dc{b) < np'^-^}. The next 
three assertions imply Theorem 11.41 

w.Lp. \{abc eT -.a^A' orb^B']\< an^p^; (6) 

w.l.p. \{abc £T -.aG A',b£ B', d{a, h) > 10np/s}| < an^p^] (7) 
Pr(|{a6c G T : d{a,b) < 10np/s}\ > (1 + a)nV) < exp[-f]5(nVs)]- (8) 

We use B{m,a) for a random variable with the binomial distribution 
Bin(?7z, a). For the proofs of ([6])-(l8]) we need the following standard bounds; 
see e.g. [TJ Theorem A. 1.12], [8, Theorem 2.1(a)] and [2, Lemma 8.2]. 

Lemma 2.1. There is a fixed C > so that for any K > 1 + e, m and a, 

Fr{B{m, a) > Kma) < minKe/K)^™", exp[-Ce^i<:ma]}. (9) 

(Though unnecessary, the K in the second expression will be helpful below.) 
When m = n and a = p we use qx for the r.h.s. of ([9]). 

Lemma 2.2. Suppose wi, . . . ,Wm G [O,^;]. Let ^i, . . . ,^m be independent 
Bernoullis, ^ = Y1 ^i'^ij o.'^d E^ = p. Then for any r] > and A > rjn, 

Pr(C > /i + A) < exp[-S7^(A/2;)]. 

Lemma 2.3. For K > 1 + e and X £ {A, B, C}, 

. , * I 3n • eK~^ if qk > n^"^ , , 

|{xeX:d(x)>i^np}|<*r;,:= ,2,^, ' ^^ . (10) 

I K\aK otherwise. 

The first, ad hoc value of r^ is for use in the proof of ([8]). Note that 

e'^npt ( 2enpt/K if i^ > 1 + e 
KlnK ^ { enp/K if K > p-^. ^ ' 

Proof of CoroUarv \2.3[ Write A^ for the left side of ([TT]) and let Qk = Qt^k = 
r and (w.l.o.g) X = A. If g < n~^ then, since the dsiays and dc'(a)'s are 



independent copies of B{n,p), two applications of Lemma \27\] (and a little 
checking) give 



Pr(iV > r) < Fr{B{2n,q) > H) < (2e^)'" < exp[-f^(eVp^t)]. 

If g > n~^ then exp[— Ce^-fiTnp] > q implies Knp < 2C^^e~^lnn, while 
([5]) gives g < exjp[—Ce'^Knp\ < exTpl—Cs'^K] < eK~'^ (the last inequality 
gotten by observing that explCe^^KjeK^^ is minimized at K = Ae/C and 
assuming, as we may, that e < (Ce/4)'^''^). It follows that 

Pv{N >r)< Pr(5(2n,g) > r) < exp[-ri(eni^"'^)] < pf^("V)^ 

where the second inequality uses r > 3nq/2 (and Lemma |2.1|) and the (very 
crude) third inequality uses the above upper bound on Knp. 



We will also make occasional use of the fact that for any /3 > and p, 

/ln(l/p) < (e/3)"^ and /ln2(l/p) < 4(e/3)~^ (12) 

Proof of Q. For K > p-^ {> 1 + e; see (^) Lemma O (with ([U])) gives 
\{a : d{a) > Knp}\ <* enp/K (note K > p~^ implies qx < n^'^), and 
similarly with b in place of a. On the other hand, with Ka = d{a)/{np), 

w.l.p. |V(iVij(a),iVc(a))| <max{2i^>V,f^Vi}=:/3a Va (13) 

(and similarly for 6), since, given any V(A), the probability that the event 
in (J13p fails is (again using Lemma l2.ip less than 

^Pr(i?(K2^V,p) > /3a) < Y,exp[-n{f3a)] < exp[-fi(nVt)]. 

o a 

This gives ([6]) since, with J = y/t/{2p) (so /?„ = 2K^n'^p^ iff K„ > J) and 
u = [— log2( Jp)J , \{abc G T : a A'}| is at most 

u 

\{abc eT:p-^ <Ka<J}\+Y, l{«^c € T : K^ € [2V, 2*+V]}| 

i=0 

<* enpi+ Vp^t + ^ £^ . 2 . 2^*+ W/ < iJenY 

j=0 
(using (fT^ ). and, of course, similarly for \{abc G T : 6 B'}\. 



Proof of dH). FoT K > J := 10/s, let Ak = {a : 3b e B', d{a,b) > 
Knp}, and define Bk similarly. Given V{B,C) the events {o G Ak} are 
independent with, for each a, 

Pv{a £ Ak) < nFr{B{np^^^,p) > Knp) < np^"'^^'^ =: q, 

using Lemma l2.ll (with ep^^'^ / K < p^''^, which follows from ()12p ) for the 
second inequality. Now Knpt > lOmaxjnp, t} > 71nn (say) implies both 
enq^'"^ < 1 and q < p^^P'"^^ so we have (again using Lemma l2.ip 



Prd^i^l > enp/K) < {enq/\enp/K']Y''Pl^ < (gi/2)£np/i^ < exp[-en^ph/8]. 

Thus \Ak\ <* enp/K, and similarly for Bk- 

Now thinking of first choosing V(C) (which determines the Ak^s and 
Bk's), we have |^j|,|i?j| <* enps, so that E\V{Aj,Bj)\ <* e^s^n^p^. 
Lemma |2. II (using, say, es^p < p^'"^, which follows from (I12p ). then gives 

\V{Aj,Bj)\<*enV- 
We may then bound the left side of ([7]) by 

\V{Aj, Bj)\np + Y^ 2^^V2nV <* (e + ^e^Wp^, 

i>0 

where the first term corresponds to aftc's with d{a,b) E [Jnp,np], and the 
ith. summand to those with d{a,b) G [2^np,2^~^^np] (using \{abc G T : a G 
A',bG B',d{a,b) G [Knp,2Knp]}\ < \AK\\BK\2Knp <* 2e'^n^p^/K). 



Proof of ([8]). We first show 

Y{d^{c) : d{c) > (1 + s)np} <* 40enV- (14) 

Setting V = [— log2((l + £)p)\, u = [— log2((l + e)p^)\, and using Lemma 
] (with (jlip ) , we have 

Vlr/2M • . '^(^.) fz [9* 2*+in <* / ^^(1 + e)^V2' if i > n 



provided K{i) := (1 + e)2* satisfies qK{i) < n ^. The left side of p^ is thus 
w.l.p. at most 

U V 

3enV Yl 2"^'^^ + ^^(1 + ^)^V[2t Yj'^' + Yj ^1 < 40enV, 

i>0 i=0 i=u+l 

where the first term on the left, covering c's with jtt-t — S [2\2*'*"^] for an 
i with Qku) > n~^, again comes from Lemma 12.31 and we used (I12p to say 
(say) p^~'^t < p^. 

Finally, set Cab = l{a6G£{H)}- We have 

^{d(a, b) : d{a, b) < Wnp/s} < ^ (f{c) <* (1 + 42e + e'^)n^p'^ (15) 
(by (fH|) . where "<*" refers to the choice of V(C)), and 

\{abc e T : d{a,b) < 10np/s}\ = ^{Cafed(o,6) : d{a,b) < lOnp/s}, 
so that Lemma 12.21 (with z = lOnp/s) combined with (J15p gives 

Pr(|{a6c G T : d{a,b) < 10np/s}| > (1 + Q)nV) < exp[-ila(nVs)]- 



3 Lower bound for small p 

Here we prove Proposition [L2l As noted in Section [H we only need to prove 
it for p < m~^ Inm; but we will show a little more: 

Proposition 3.1. For l/m < p < o{m~^'^), 

Pr(G contains 2{^S\p^ disjoint triangles} > exp[— 0(?tt, p )]. 

The point is that the natural bound of Proposition 11.21 is achieved, at least 
for p as in Proposition 13.11 more "generically" than the usually stronger 
exp„[0(r7T,^p^)]. There are similar statements for larger p — for example, for 
l/m < p < o(?7i~^'^) Proposition 13.11 is true with "disjoint" replaced by 
"edge-disjoint" — but we will not pursue this tangent further here. 

Proof. Write T = 7^ for the set of triangles of Km and set M = [2(™)y|3] . 
The number of sets S consisting of M vertex-disjoint members of T is 

M-l 

^. U (""-.'')- Mir (16) 



(where "~" uses p < o(m~^/^)). For such S let Qs and Rs be the events 
{G contains all triangles of S} and {S is the set of triangles of G}; let a = 
3M{m — 3) be the number of members of T that share a (necessarily unique) 
edge with the union of the triangles of S, and set b = CI}) — M — a. Then 

Pr{Rs) = Pt{Qs) PiiRslQs) > P^'H^ - /)"(! - P')\ (17) 

where the inequality is given by Harris' Inequality [7] (which for our purposes 
says that for a product probability measure fj, on {0, 1} (with E a finite 
set) and decreasing events Ai C {0, 1}^, one has fi{r\Ai) > n/^(-^i))- Thus 
(except for a factor (1 — o(l))) the probability that G contains exactly M 
triangles and no two of these share a vertex is at least the product of the 
right sides of (fT6]) and pT|) . which is easily seen to be ex.p[—0{m^p^)]. 



4 Remarks 

For a fixed graph H write ^ = S,H{n,p) for the number of (unlabeled, say) 
copies of H in G{n,p) and set fi = //i^(n,p) = E^. A beautiful result of 
Janson, Oleszkiewicz and Rucihski [9] says 

Pr(^ > (1 + r])fi) < eM-^H,rjiM{H, n,p))], (18) 

where we omit the definition of the parameter M (understanding of which 
is one of the main concerns of [9j), but just mention that (i) if not zero, 
the probability in (fTHj) is bounded below by expp[0//^^(M(i7, n,p))], and (ii) 
except in the uninteresting case p < n~^'^^'^^', M{Kr,n,p) is Q{n'^p'^'^^). In 
particular p^ includes the result of [12] mentioned in Section [1] (and a bit 
more, since [12] requires p > n~^ logn). 

Of course (fTH]) is sharp when H = K2, but we guess, perhaps optimisti- 
cally, that this is the only case where the analogue of Theorem 11.11 fails; 
more precisely, for any connected H ^ K2 and p not too small, we should 
have 

Pr{C{H,n,p) > {l + r])fiH{n,p)) < p^HA^HH,n,p)) _ ^^g^ 

At this writing we think we can at least push the present argument to prove 
()19p for complete graphs. We won't try to say here what the lower bound 
on p should be in general. For Kr it should be n~^' '^''^^'(logn)^' I^'"^-'^-'''"^^)], 
which is (essentially) where the lower bound in (i) above becomes larger 
than the bound in (flQl). 



Added in proof. We have now completely settled the problem for cliques 
and believe we know what should happen for general H and p; these items 
will appear in [6]. 

Acknowledgment. We would like to thank one of the referees for an 
exceptionally careful reading. 
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